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Abstract. If w = ua for a G X = {1,2} and u 6 S*, then w is said to be a 
simple right extension of ii and denoted by u -< w. Let A: be a positive integer 
and P k (e) denote the set of all C°°-words of height k. Set u\, 11,2, • • • ,u m E 
P k (e), if u\ -< U2 ~< ■ ■ ■ ~< u m and there is no element v of P k (e) such that 
v -< u\ or n m -< v, then iti -< 7x2 -4 ■ ■ ■ -< u m is said to be a maximal right 
smooth extension (MRSE) chains of height k. In this paper, we show that 
MRSE chains of height k constitutes a partition of smooth words of height k 
and give the formula of the number of MRSE chains of height k for each positive 
integer k. Moreover, since there exist the minimal height hi and maximal height 
/12 of smooth words of length n for each positive integer n, we find that MRSE 
chains of heights h\ — l and h%+l are good candidates to be used to establish the 
lower and upper bounds of the number of smooth words of length n respectively, 
the method of which is simpler and more intuitionistic than the previous ones. 
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1. Introduction 



Let E = {1,2}, E* denotes the free monoid over E with e as the empty word. If 
w = W1W2 ■ ■ ■ w n , Wi G E for i — 1, 2, • • • , n, then n is called the length of the word w 
and denoted by For % — 1, 2, let \w\i be the number of z which occurs in to, then 
= |iw|i + |to| 2 . 

Given a word w G E*, a factor or subword u of w is a word M G E* such that 
u> = im?/ for x,y G E*, if x = e, then tt is said to be a prefix of w. A run or 
block is a maximum factor of consecutive identical letters.. The complement of u = 
U\U2 ■ ■ ■ u n G E* is the word u = UiU 2 • • • u n , where 1 = 2,2 = 1. 

The Kolakoski sequence K which Kolakoski introduced in [13], is the infinite 
sequence over the alphabet E 

K = ^^^^^^^2^^1^^22^^1^^22^^^^2^^^^22^^^ 

122112122122 ~=K 

which starts with 1 and equals the sequence defined by its run lengths. 

I would like to thank Prof. Jeffrey O. Shallit for introducing me the Kolakoski 
sequence K and raising eight open questions on it in personal communications (Feb. 
15, 1990), the fourth and eighth problems of them are respectively as follows: 

(1) Prove or disprove: \K{\\ ~ \Ki\2, which is almost equivalent to Keane's ques- 
tion. 

(2) Prove or disprove: \Ki\ ~ a(3/2) J (This would imply (1)), where a seems to 
be about 0.873. Does a = (3 + V5)/6 ? 

where K = 2 and define K n+ i as the string of 1' and 2' obtained by using the elements 
of K n as replication factors for the appropriate prefix of the infinite sequence 1212 ■•• . 

The intriguing Kolakoski sequence K has received a remarkable attention [1, 3, 
5, 11, 15, 16]. For exploring two unsolved problems, both wether K is recurrent and 
whether K is invariant under complement, raised by Kimberling in [12], Dekking pro- 
posed the notion of C°°-word in [6]. Chvatal in [4] obtained that the letter frequencies 
of C°°-words are between 0.499162 and 0.500838. 

We say that a finite word w G E* in which neither 111 or 222 occurs is differen- 
tiable, and its derivative, denoted by D(w), is the word whose jth symbol equals the 
length of the jth run of w, discarding the first and/or the last run if it has length 
one. 

If a word w is arbitrarily often different iable, then w is said to be a C°°-word (or 
smooth word) and the set of all C°°-word is denoted by C°°. 

A word v such that D(v) = w is said to be a primitive of w. Thus 11, 22, 211, 
112, 221, 122, 2112, 1221 are the primitives of 2. It is easy to see that for any word 



w G there are at most 8 primitives and the difference of lengths of two primitives 
of w is at most 2. 

The height of a nonempty smooth word w is the smallest integer k such that 
D k (w) = e and the height of the empty word e is zero. We write ht(w) for the height 
of w. For example, for the smooth word w = 12212212, D 4 (w) = e, so ht{w)=A. 

2. Maximal right smooth extension chains 

Let TV be the set of all positive integers and P k (e) denote the set of all smooth words 
of height k for k G /V, then 

P{e) = {1,2,12,21}, (1) 
P 2 (e) = {121,212,11,22,211,122,112,221,2112,1221,1211,12112, 

2122, 21221, 1121, 21121, 2212, 12212}. (2) 

Definition 1. Let w, u, v G S* if w = uv, then w is said to be a right extension of 
u. Especially, if v = a G X, then w is said to be a simple right extension of u, and is 
denoted by u ~< w. 

Definition 2. Let ui, u 2 , ■ ■ ■ , u m G P k (e), where k G Af. 

iti -< u 2 -< < u m , (3) 

and there is no element v of P k (e) such that 

v -< u\ or u m -< v, 

then (3) is said to be a maximal right smooth extension (MRSE) chain of the height k. 
Moreover, u\ and u m are respectively called the first and last members of the MRSE 
chain (3). 

Let H k denote the set of all MRSE chains of the height k. For £ G H k , £ — U\ -< 
«2 -< • ■ ■ -< u m , the complement of £ is u\ -< u% -< ■ ■ ■ -< u m , and is denoted by £. 
It is clear that £ is also a MRSE chain of the height k. In addition, for A C H k , 
■\ {C: £G A}. 

Definition 3. For £ G H k+1 , £ = u\ -< Ui -< ■ ■ • -< u m , where k G Af. If there is an 
element r\ — V\ -< v 2 -< ■ ■ ■ -< v n G H k such that u±, u 2 , ■ ■ ■ , u m are all the primitives 
of v i, v 2i • " " ,fn, then £ is said to be a primitive ofrj. 



For example, £ = 121 -< 1211 -< 12112 G H 2 is a primitive of rj = 1 -< 12 G H 1 , 
£ = 212 -< 2122 -« 21221 G H 2 . 

For a set A, let \A\ denote the cardinal number of A. Next we establish the 
formula of the number of the members of H k . For this reason, let 

H k = {£ G H k : £ = Mi -< u 2 -<•••-< u m and first{ui) = 1}; (4) 
H 2 = {£ G : £ = ui -< u 2 -<•■■-< n m and first(ui) = 2}. (5) 

It immediately follows that 





Tjk . 


(6) 


H k 




(7) 


\ H i\ 


= \H k 2 \. 


(8) 



From (1) and (2) we have 

H 1 = {1 ^ 12, 2 ^ 21}; (9) 

Hi = {1 ^ 12}; 

H\ = {2^21}; 

H 2 = {121 -< 1211 -< 12112, 212 -< 2122 -< 21221, 11 -< 112 -< 1121, 

22 -< 221 -< 2212, 211 -< 2112 -< 21121, 122 -< 1221 -< 12212}; (10) 

H 2 = {121 -< 1211 -< 12112, 11 -< 112 -< 1121, 122 -< 1221 -< 12212}; 

H 2 = {212 ^ 2122 ^ 21221, 22 ^ 221 ^ 2212, 211 ^ 2112 ^ 21121}. 

Thus from (9) and (10), we see that every MRSE chain of height k is uniquely 
determined by its first member u\ and each member of P k {e) exactly belongs to one 
MRSE chain of height k for k = 1, 2 and 

|if 2 | = 31^1 (11) 

Actually, the above result holds for every k G N . 

Theorem 4. H k is stated as above. Then each member of P k (e) exactly belongs to 
one MRSE chain of height k, that is, H k gives a partition of the smooth words of 
height k and 

\H k \ = 2-3 fc " 1 for alike AT. (12) 

Proof. We proceed by induction on k. From (11) it follows that (12) holds for 
k = 1, 2. Assume that (12) holds for k = n — 1 > 1. 



Now we consider the case for k = n. Since for each rj = U\ -< u-i -< • • • -< u m G 
H™~ , from the definition 2 and (4), we see that first(u\) = first(u2) = ■■■ = 
first(u m ) = 1, and u i+ i = UiOt where % = 1, 2, • • • , m — 1, a = 1,2. Thus if a = 1 
then the two primitives p(u i+ i) of are 

p(u i+1 ) = /3A^O i+1 )7 

— p( u i)l, where j3, 7 £ E, 

so p(«i) -< 

If a = 2 then the four primitives p t (ttj+i) of ttj+i are 
Pt(«<+i) = /3A / ^ 1 (n i+1 )7* 

= p(ui)77 , where (3 = 1,2, t — 0,1, 

hence p(ttj) -< po( M i+i) ~< Therefore, 77 has exactly two primitives and the 

primitives of ui, 1*2, ■ • • , and u m all occur in the two primitives of 77. 

For example, r] = 121 -< 1211 -< 12112 G has exactly two primitives: 

H = 121121 -< 1211212 -< 12112122 -< 121121221 and ft. 

Analogously, we can see that each member r] of H%~ has exactly four primitives 
and the primitives of u±, u-i, ■ ■ ■ , and u m all occur in the four primitives of 77. 
For example, 77 = 212 -< 2122 -< 21221 G H\ has exactly four primitives: 
fi = 22122 -< 221221 -< 2212211 -< 22122112 -< 221221121; 
£2 = 122122 -< 1221221 -< 12212211 -< 122122112 -< 221221121 and 6- 
Thus, by the induction hypothesis, it follows from (7) and (8) that 

\H n \ = \H?\ + \H$\ 

rn—l I 1 a I zjn—1 1 



2 ■ \H?~ L \ + 4- \H r 2 



3 • d^r 1 ! + i^r 11 



3- liJ"" 1 ! 
2-3 n ~ 1 . □ 



3. The number of smooth words of length n 



Let 7(71) denote the number of smooth words of length n and pir(n) the number of 
subwords of length n which occur in K. 



Dekking in [6] proved that there is a suitable positive constant c such that 
c • n 2,15 < 7(71) < n 7 ' 2 and brought forward the conjecture that there is a suitable 
positive constant c satisfying px(n) ~ c- n 9 (n — > 00), where g = (log 3)/ log(3/2). 

Recall from [18] that a C^-word w is left doubly extendable (LDE) if both lw and 
2w are C°°, and a C^-word u> is fully extendable (FE) if liol, lw;2, 2wl, and 2u;2 all 
are C^-words. For each nonnegative integer k, let A(k) be the minimum length and 
B(k) the maximum length of an FE word of height k. 

Weakley in [18] proved that there are positive constants c\ and c 2 such that for 
each n satisfying B(k — 1) + 1 < n < A(k) + 1 for some k, c\ ■ n q < 7(71) < c 2 ■ n q . 

It is a pity that we don't know how many positive integers n fulfil the conditions 
required. Set 7'(ra) = ■y(n + 1) — 7(71), Weakley in [18] gave 

n-l 

T (n) = 7(0) + i («) /° r " > 2. (13) 

8=0 

Let F(n) denote the number of LDE-words of height n, Shen and Huang in [14, 
Proposition 3.2] established 

F(n) = 4 • ?> n ^ 1 for each positive integer n. (14) 

Huang and Weakley in [9] combined (13) with (14) to show that 

Theorem 5 ([9] Theorem 4). Let £ be a positive real number and N a positive integer 
such that for all LDE words u with \u\ > N we have |w|2/H > (1/2) — £. Then there 
are positive constants c\, C2 such that for all positive integers n, 

log 3 log 3 

d ■ n 1 °s« 3 / 2 )+s+( 2 / JV )) < 7(71) < c 2 • n lo s(( 3 / 2 )-«) . 

Let 7a,b(7i) denote the number of smooth words of length n over 2-letter alphabet 
{a, b} for positive integers a < b, Huang in [10] obtained 

Theorem 6. For any positive real number £ and positive integer uq satisfying \u\t,/\u\ > 
£ for every LFE word u with \u\ > n , there exist two suitable constants c\ and c 2 
such that 

log(2b-l) log(2fe-l) 

c x • n lo s( 1 +( a + i, - 2 )( 1 '«)) < 7a,&(7i) < c 2 • n 1 °s( 1 +( a + b - 2 )0 

for every positive integer n. 

Since there are the minimum height hi(n) and maximum height h 2 (n) of smooth 
words of length n for each positive integer n, so the lengths of smooth words of the 
height hi(n) — 1 must be less than n and the length of smooth words of the height 
h,2{n) + 1 must be larger than n. Now we are in a position to use the number of MRSE 



chains of the suitable height k to bound the number of smooth words of length n, 
which is simpler than the ones used in [9, 10]. The estimates for the heights of smooth 
words of length n are borrowed from [10] in the following proof. 

Theorem 7. For any positive number 9 and n satisfying (li^/H > 9 for \u\ > n , 
there are suitable positive constant ci,c 2 such that 

log 3 log 3 

C\ ■ n lo s( 2 ' e ) < 7(71) < C2 • n 1 °s( 1 + 9 ) for any positive integer n. 
Proof. It is obvious that 

\w\ < \D{w) \ + I .D (it;) 1 2 for each smooth word w. (15) 
First, since |it|2/|it| > 9 for \u\ > no, from (15) one has 

\w\ > (1 + 9)\D(w)\ for \D(w)\ 2 /\D(w)\ > 9, 
which implies 

\D(w) I < a\w\ for \w\ > N , (16) 

where N is a suitable fixed positive integer such that \D(w) \ > n as soon as \w\ > N , 
a = 1/(1 + 9). Since there are finitely many smooth words of length less than No, 
from (16) we see that there exists a suitable nonnegative integer / such that 

|-D(u;)| < a|i<;| + / for each smooth word. (17) 

Let ko be the least integer such that the length of smooth words of height ko is 
larger than and r be the smallest length of smooth words of height ko. Let k be 
the height of the smooth words w such that ht(w) > ko, then ht(D k ~ k ° (w)) = k . So, 
from (17), we get 

r < \D k - ko (w)\ 

< a\D k - k °-\w)\+l 

< a 2 \D k ~ ko ~ 2 (w)\+al + l 

< a k ~ k °\w\ + a k ~ k °l H Va 2 l + al + l 

I 



< a k ko \w\ + 



1 — a 



Thus 



'l/a) k k ° < where X = r 



X 1 — a ' 



which means 



i » / \ , ^ log | u; | L , log A 
= k < - — — — + k 



log(l/a) log(l/a) 

Since there are only finitely many smooth words satisfying ht(w) < k , so there is a 
suitable constant t 2 such that 

log \ w\ 

ht(w) < - — ; — r-x + t 2 for each smooth word. (18) 
log(l/a) 

Therefore, the maximal height h 2 (n) of all smooth words of length n satisfies 
lose n 

M »> 5 MlW^ (19) 

Put k = h2(n) + 1, then the length of every smooth word of height k is greater 
than n, so each smooth word of length n can be right extended to get a MRSE chain 
of height k, which suggests 7(71) < \H k \. Consequently, from (12) and (19) it follows 
the desired upper bound of j(n). 

Second, since the complement of any smooth word is a smooth word of the same 
length, the theorem's hypothesis implies that \D(w)\i/\D(w)\ > 8, so \D(w)\2/\D(w)\ < 
1 — 9. From (15) it follows that 

\w\ < 0\D(w)\ + q for each C°°-wordw, (20) 

where /3 = 2 — 9, q is a suitable positive constant. Thus 

\w\ < ^\D k ~\w)\ + < 2/? fc " 1 + = (2 + ^y)/^ 1 = tp~\ 

where t = 2 + q/((3 — 1), k is the height of \w\. Wherefore, the length \w\ of a smooth 
word w with height k is less than t/3 fc_1 and k — 1 > (log \ w\ — log t)/ log f3. Hence, 
the smallest height h\ (n) of smooth words of length n meets 

logn logt 

hl n > ] — M — a\ + *i' where tl = 1 ~ 1 a- 21 

log(2 - 9) log p 

Then the length of all smooth words with height m = hi(n) — 1 is less than 
n, which means that the length of the last member last (£) is less than n for each 
^ G H m . Since each smooth words of length no more than n — 1 can be extended 
right to a smooth word of length n, we see 7(71) > \H m \. Herewith, from (12) and 
(21) we get the desired lower bound of 7(71). □ 



4. Concluding remarks 



Let a and b be positive integers of different parities and a < b. Lately, Sing in [15] 
conjectured: 

There are positive constants ci, c 2 such that 

ci • n < 7a,b(n) < c 2 • n , where (5 = - — - — . 

log((a + 6)/2) 

Theorem 6 means Sing's conjecture should be revised to be of the following form 

o / n o lo g( 2& - 1) 

ci • n < 7a,&( n ) < c 2-n, where = - — - — . 

log((a + 6)/2) 

For 2- letter alphabet S = {a,b} with a < b, let P^{e) denote the set of smooth 
words of height k for j G A/". For a G S, set 

£ i = a 1 -< a % a -< a l a 2 -<••■-< a'a^ 1 for 1 < i < b - 1. 

and 

^ 1 = {^l^ = ^ore t ,2 = l,2,--- ,6-1}. 

Let if 2 denote the set of primitives of the members in H 1 , then it is easy to see H 2 
constitutes a partition of P 2 {e). So continue, we can define the set H k for each k G M 
and H k constitutes a partition of P h (e). Using the method similar to Theorem 7, we 
could establish the corresponding result to Theorem 6. 
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